Nonvolatile semiconductor memory device with advanced multi-page program operation

ABSTRACT

A nonvolatile semiconductor memory device for an efficient program of multilevel data includes a memory cell array having a plurality of banks and a cache block corresponding to each of the plurality of banks. The cache block has a predetermined data storage capacity. A page buffer is included which corresponds to each of the plurality of banks. A programming circuit programs all of the plurality of banks except a last of said banks with page data. The page data is loaded through each page buffer and programmed into each cache block such that when page data for the last bank is loaded into the page buffer, the loaded page data and the page data programmed into the respective cache blocks are programmed into respective corresponding banks. Accordingly, the time taken in programming can be reduced without increasing a unit of program in a multilevel flash memory, thereby improving performance in a multilevel program of a nonvolatile semiconductor memory device.

CROSS-REFERENCE TO RELATED APPLICATION

This U.S. non-provisional patent application claims priority under 35 U.S.C. §119 of Korean Patent Application 10-2007-0012039 filed on Feb. 6, 2007, the entire contents of which are hereby incorporated by reference in its entirety.

BACKGROUND OF THE INVENTION

1. Field of the Invention

Embodiments of the invention relate to nonvolatile semiconductor memories. More particularly, embodiments of the invention relate to a multilevel nonvolatile semiconductor memory capable of storing data of two or more bits in one memory cell.

2. Description of the Related Art

A recently rapid development of information processing apparatuses such as a computer, etc. tends to bring about a high-speed operation and large capacity in semiconductor memory devices employed as main components of the information processing apparatuses.

Typically a semiconductor memory device may be largely classified as a volatile semiconductor memory device and a nonvolatile semiconductor memory device. The volatile semiconductor memory device may be subdivided into a dynamic random access memory and a static random access memory. The volatile semiconductor memory device is fast in read and write speed, but has a shortcoming that contents stored in memory cells are lost when an external power supply is cut off. Conversely, nonvolatile semiconductor memory devices may be classified as mask read only memory (MROM), programmable read only memory (PROM), erasable programmable read only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), etc.

Such nonvolatile semiconductor memory devices have been typically used to store data even without external power. However, in the MROM, PROM, and EPROM, erase and write (or program) operations can not be executed through the electronic system itself. Thus making it difficult to erase or re-program programmed-contents in an on-board state. In an EEPROM, electrical erase and write operations are available through the system itself. This allows EEPROMs to be used as system program storage devices or auxiliary storage devices. Various types of electronic devices controlled by a computer or micro-processor require the development of high density electrically erasable and programmable EEPROMs. A data storage device used in, for example, a digital camera, must be compact, have a relatively higher density and higher performance characteristics.

A NAND-type flash EEPROM having a flash erase function appearing as the design and fabrication technique of EEPROM has been advanced which has a high integration as compared to a general EEPROM and is very advantageous to an application to a large-capacity auxiliary memory. The flash EEPROM is classified as NAND, NOR and AND types, depending upon what kinds of types in unit memory cell arrays are configured. It is well known that NAND type memory has high integration as compared to NOR or AND type.

A configuration of a general nonvolatile semiconductor memory device and fabrication sections of memory cells in a memory cell array are disclosed in, for example, U.S. Pat. No. 6,295,227 (“the '227 patent”) published on Sep. 25, 2001. As disclosed in the '227 patent, a NAND type EEPROM may be comprised of a data input/output buffer, a row decoder for selecting word lines, a column decoder, a column gate, a page buffer for sensing and storing input/output data of memory cell transistors, a booster circuit for generating a boosted voltage, a control circuit for controlling operation of a memory device, and a memory cell array. The memory cell array includes word lines that intersect bit lines. Each memory cell transistor of a NAND cell unit (or cell string) stores data. The word lines control gates of the selected transistors and the memory cell transistors of the NAND cell unit.

A NAND cell unit is formed in a p-type well formed on an n type well. The NAND cell unit constituting a unit of one string is defined by a first selection transistor SST whose drain is connected to a bit line, a second selection transistor ST whose source is connected to a common source line, and 16 or 32 memory transistors whose channels are connected in series between a source of the first selection transistor SST and a drain of the second selection transistor ST. Each memory cell transistor constituting a NAND cell unit has a floating gate formed by interposing a gate oxide on a channel region between a source and a drain, and a control gate formed on the floating gate FG through an interlayer insulation layer. Charge functioning as program data through a program voltage applied to the control gate CG is gathered in floating gate FG.

The erase and program (or write) operations are performed by using F-N tunneling current. For example, in order to erase a memory cell, a very high potential is applied to a substrate and a low potential is applied to control gate CG. The potential determined by a coupling ratio of a capacitance between CG and FG and a capacitance between FG and the substrate is applied to the floating gate FG. When a potential difference between a floating gate voltage Vfg and a substrate voltage Vsub is greater than a potential difference creating the F-N tunneling, electrons gathered in the floating gate FG move to the substrate. This lowers a threshold voltage Vt of a memory cell transistor. Voltage Vt is sufficiently lowered and 0 V is applied to the control gate CG and the source. When an appropriate level of voltage is applied to the drain, current flows. This is commonly referred to as “ERASED” and is generally represented as logic ‘1’.

In order to write to a memory cell, 0V is applied to the source and drain and a very high voltage is applied to the control gate CG. An inversion layer is formed in channel region and the source and drain both have a potential of 0V. When a potential difference between Vchannel (0 V) and Vfg, which is determined on the basis of the coupling ratio of the capacitance between the control gate CG and the floating gate FG and the capacitance between the floating gate FG and the channel region (of the substrate), becomes enough to create F-N tunneling, electrons move from the channel region to the floating gate FG. In this manner, voltage Vt increases and when a predetermined level of voltage is applied to the CG, 0V is applied to the source, and an appropriate level of voltage is applied to the drain. This prevents current flow and is referred to as “PROGRAMMED” indicated as logic ‘0’.

In the memory cell array, a page indicates memory cell transistors in which control gates are connected in common to a particular word line. Multiple pages include a plurality of memory cell transistors defining a cell block. A cell block unit generally includes one or a plurality of cell strings per bit line. The NAND flash memory has a page program mode to accommodate high speed programming. A page program operation is classified as a data loading operation and program operation. The data loading operation sequentially latches and stores, in data registers, data of a byte magnitude from input/output terminals. The data registers are referred to as a page buffer and correspond to the number of bit lines. The program operation includes simultaneously writing the data stored in the data registers to memory transistors associated with a selected word line through selected bit lines.

In the NAND-type EEPROM described above, read operation and program (write) operation are generally performed by a page unit. The erase operation is performed by a block unit. In particular, electron movement between a channel and a FG of the memory cell transistor appears only in the program and erase operations. In the read operation, data stored in a memory cell transistor is just read without damaging the data after the program and erase operations have been performed. In the read operation, a predetermined voltage (typically, a ground voltage) is applied to CG of a selected memory cell transistor and a voltage (typically, a read voltage) higher than the predetermined voltage is applied to CG of a non-selected memory cell transistor. Current flows or does not flow in a corresponding bit line according to a program state of the selected memory cell transistor. When a threshold voltage of the programmed memory cell is higher than a reference value under a predetermined voltage condition, the memory cell is an off-cell, thereby charging a corresponding bit line to a high voltage level. Conversely, when a threshold voltage of the programmed memory cell is lower than a reference value, the memory cell is an on-cell, and a corresponding bit line is discharged to a low level. A state of such a bit line is read out as ‘0’ or ‘1’ through a sense amplifier.

Increases in memory capacity have resulted in multilevel cell (MLC) technology in which data of 2 bits (i.e. four states) or more are programmed or stored in a single memory cell of, for example, a NAND type EEPROM. A threshold voltage of a floating gate MOS transistor defining one memory cell may have one of the four states by appropriately controlling the number of electrons injected into a floating gate in a multilevel program operating mode. In a multilevel read mode, a voltage which indicates a medium from among threshold voltages of four states, is applied as a primary read reference voltage. This is used to sense whether a selected memory cell belongs to a high state or a low state. Subsequently, two different secondary-read reference voltages are used to sense whether the memory cell has a most significant state among the high states or a least significant state among the low states.

Details for such multilevel NAND cell type flash memory are disclosed in U.S. Pat. No. 6,807,096 issued on Oct. 19, 2004 and U.S. Pat. No. 6,937,510 issued on Aug. 30, 2005 which is provided herein for reference only. A multilevel NAND flash memory has a page program mode for high-speed programs. FIG. 1 provides a multi-page programming method in a multilevel flash memory according to a conventional art. FIG. 2 illustrates timings related to the multi-page program illustrated in FIG. 1. FIG. 3 illustrates a distribution change of threshold voltage generally appearing in a multilevel data program.

Referring to FIG. 1 a method of ‘multi-page program’ is disclosed to simultaneously program four pages of data in order to increase programming speed in a multilevel NAND flash memory having four banks 1, 2, 3 and 4. Page data for the four banks are loaded into corresponding page buffers 60-63 in respective steps S1, S2, S3 and S4. The four banks are simultaneously selected and a program operation is performed into designated pages of respective banks through step S5. This page programming method is primarily used to shorten a program time per page in a multilevel flash memory based on a considerably low programming speed.

FIG. 2 illustrates timing diagrams provided when programming four pages through such multi-page program step sections S1, S2, S3, S4 and S5 related to the loading and programming associated with steps S1, S2, S3, S4 and S5 of FIG. 1. Step S1 includes in order, a time interval C1 receiving a data load command, time interval C2 receiving an address, time interval C3 receiving program data and time interval C4 receiving a program command. Program data is loaded into page buffers 60-63 corresponding to a respective bank in steps S1, S2, S3 and S4. In step S5, blocks 10-13 are selected at the same time to perform programming operations.

FIG. 3 illustrates a distribution change of threshold voltage generally appearing in a multilevel data program. A transverse axis indicates a threshold voltage of a memory cell, and a longitudinal axis indicates the number of programmed memory cells. In a flash memory that stores 2 bits of data in one memory cell, data of respective cells are allocated into mutually different pages. A graph (a) of FIG. 3 denotes a program operation of least significant bit (LSB) data. A voltage stepping of ISPP (Incremental Step Program Pulse) is determined largely in order to speed up the program. Then, a comparatively wide threshold voltage distribution is obtained. This type of programming method is called an “SLC Program”. An LSB-programmed cell has a wider threshold voltage distribution in a step that adjacent cells are programmed, as shown in a graph (b) of FIG. 3, as compared with the graph (a). Graph (c) of FIG. 3 denotes programming operation of most significant bit (MSB) data. Of the cells left in the erase state after the programming of LSB data ‘1’, a cell into which an MSB data ‘0’ is loaded moves to a state of ‘01’ as shown in an arrow AR1. Of the cells into which LSB data ‘0’ is programmed, a cell into which MSB data ‘1’ is loaded moves to a state of ‘10’ as shown in an arrow AR2. Of the cells into which LSB data ‘0’ is programmed, a cell into which MSB data ‘0’ is loaded moves to a state of ‘00’ as shown in an arrow AR3. Accordingly, in the programming step of adjacent cells after the LSB program, the threshold voltage distribution effect becomes wide to compensate for a disturbance and a threshold voltage distribution can be controlled more precisely. In such a programming method, the LSB page program can be obtained earlier by making the ISPP program voltage step greater, but the MSB page program has a considerably lower programming speed as compared with the LSB page program because it needs to precisely control three threshold voltage states. For example, when an LSB page program time tLSB is 200 μs, and an MSB page program time tMSB is 1400 μs where an average page program time becomes tPGM=(tLSB+tMSB)/2=800 μs. As a result, in the multi-page programming method as illustrated in FIG. 1, four pages are programmed at a time such that the average program time is 800 μs/4=200 μs. In the programming method of FIG. 1, a program unit becomes four pages. Consequently, a controller can provide efficient program performance when data for four pages are loaded and programmed at a time. When the program unit becomes large, an overall program time can be shortened with this enhanced program performance. However, the large program unit may have an inefficient aspect in view of managing a file system. It is, therefore, required to find a method to increase a program speed while maintaining a smaller program unit. Accordingly, there is a need for a method to enhance program speed while keeping a relatively smaller program unit in a multilevel flash memory.

SUMMARY OF THE INVENTION

Exemplary embodiments of the present invention are directed to a nonvolatile semiconductor memory device capable of reducing programming time without increasing a unit of program of a multilevel flash memory. In an exemplary embodiment, a nonvolatile semiconductor memory device for performing a programming of multilevel data includes a memory cell array having a plurality of banks, each including a cache block, and a plurality of page buffer. The page buffer corresponds to each of the plurality of banks and is configured to load page data. The cache block corresponds to each of the plurality of banks having a predetermined data storage capacity. The cache block programming the banks, except a last of the plurality of banks, with the page data from the page buffer such that when page data for the last of the plurality of banks is loaded into the page buffer, the loaded page data and the page data programmed into the respective cache blocks are programmed into corresponding banks.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 provides a multi-page programming method in a multilevel flash memory according to a conventional art;

FIG. 2 illustrates timings related to a multi-page program of FIG. 1;

FIG. 3 illustrates a distribution change of threshold voltage generally appearing in a multilevel data program;

FIGS. 4 to 6 illustrate a multi-page program sequence through respective steps in a multilevel flash memory according to some embodiments of the invention; and

FIG. 7 illustrates timings of the multi-page program referred to in FIGS. 4 to 6.

DESCRIPTION OF EMBODIMENTS

The present invention will now be described more fully hereinafter with reference to the accompanying drawings, in which preferred embodiments of the invention are shown. This invention, however, may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art. In the drawings, like numbers refer to like elements throughout.

According to an embodiment of the invention a cache block having a predetermined data storage capacity is adapted in each bank of a memory cell array to reduce programming time without increasing a unit of program as compared to a conventional program scheme. A programming circuit sequentially performs an operation of loading page data into a corresponding page buffer and an operation of programming the loaded page data into a corresponding cache block in association with rest banks (except a last bank among the banks). When page data for the last bank are loaded in the page buffer, the loaded page data and the page data programmed in the respective cache blocks are programmed at a time in each corresponding bank.

The programming circuit may comprise function blocks necessary for a data loading and execution of program, for example, a data input/output buffer, row decoder, column decoder, column gate, high voltage generating circuit and control circuit, in a nonvolatile semiconductor memory device.

FIGS. 4 to 6 illustrate a most significant bit (MSB) page program sequence through respective steps in a multilevel flash memory. Reference numbers S1, S2, S3, S4, S5, S6, S7, S8 and S9 related to a loading, cache program and MSB program shown in FIG. 7 each correspond to steps S1, S2, S3, S4, S5, S6, S7, S8 and S9 shown in FIGS. 4 to 6. Turning briefly to FIG. 7, step S1 has in order, a time interval receiving a data load command, time interval receiving an address, time interval receiving program data, and time interval receiving a program command. Step S2 includes a time interval of performing an SLC program into a cache block. In steps S2, S4 and S6, operation of programming each cache block with MSB data is performed. In step S8, operation of copying the MSB data programmed in each cache block back to a corresponding page buffer is implemented. In step S9, operation of programming blocks 10-13 of four banks with MSB page data through page buffer is performed. “HP Program Command” shown in FIG. 7 indicates “High Performance Program Command” as a specific program command to perform a high speed program through cache block.

Referring to FIG. 4, an operation of programming a plurality of banks with multilevel data will be described. It is assumed that the number of banks is four. Each of first to fourth banks 1 to 4 includes a plurality of memory blocks, each including a plurality of memory cells, and a cache block. Page buffers 60-63 are provided corresponding to the first to fourth banks 1 to 4. Steps S1-S6 provide procedures of sequentially implementing a high-speed cache program in three banks 1, 2 and 3. Each of the respective blocks 10, 11, and 12 is selected from each bank. The data are loaded in each page buffer 60, 61, 62 and programmed into a corresponding cache block 20-22. The MSB page data programmed into each cache block 20-22 is programmed through an algorithm such as an LSB page program. For example, an SLC program algorithm has a program time of 200 μS per page as shown in step sections S2, S4 and S6 of FIG. 7.

Cache blocks 20-23 each have a capacity to store individually loaded page data and are constructed of memory cells such as multilevel memory cells provided within an associated bank 1-4. Data programmed through cache blocks 20-22 are MSB page data. When a specific cache program command, for example “HP Program Command” is input, a high speed program for selected cache blocks is implemented. The least significant bit (LSB) page data is programmed directly into corresponding memory blocks of a selected bank without passing through cache blocks 20-22. As a result, an LSB page program is performed similar to that disclosed in the conventional art, and an MSB page program is performed as illustrated in FIGS. 4 to 6. By programming the MSB in this manner, program speed is improved without increasing a unit of program.

FIG. 5 illustrates steps of loading MSB data into a fourth page buffer 63 and then copying data programmed into the cache blocks 20-22 back to corresponding page buffers 60-62 via steps S7 and S8. After the MSB data is loaded in fourth page buffer 63 through step S7, a high speed program command is input. The MSB page data individually programmed in cache blocks 20, 21 and 22 of banks 1, 2 and 3 are programmed at a time corresponding to page buffers 60, 61 and 62 for a time interval T1 of FIG. 7. This is referred to as a copy-back operation which is performed through step S8.

FIG. 6 illustrates steps that MSB page data contained in page buffers 60-63 are simultaneously programmed into memory blocks 10-13 respectively through simultaneous operating step S9. This programming algorithm time is about 1400 μS and is needed as shown in time interval T2 of FIG. 7. Thus, the total programming time for four pages with MSB page data is ′3×200 μS (for blocks 1-3)+1400 μS (for block 4)=2000 μS. The average program time of one page taken in the MSB page program is ′tMSB=2000 μS/4=500 μS. The time taken in LSB page program is ′tLSB=200 μS as in the conventional art. Therefore, an average program time taken when one page is programmed becomes ′tPGM=(tLSB+tMSB)/2=350 μS. Accordingly, the average programming time can be reduced in performing a data loading of each page using a cache block adapted inside a semiconductor chip. This programming time reduction is done without increasing a unit of program in a multilevel flash memory as compared with a conventional program scheme, thereby enhancing memory device performance.

Although the present invention has been described in connection with the embodiment of the present invention illustrated in the accompanying drawings, it is not limited thereto. It will be apparent to those skilled in the art that various substitutions, modifications and changes may be made thereto without departing from the scope and spirit of the invention. 

1. A multilevel flash memory for programming multilevel data comprising: a memory cell array including a plurality of banks each having a plurality of memory blocks and a cache block, each of said memory blocks having a plurality of memory cells configured to store multilevel data; a page buffer communicating with said memory array, said page buffer corresponding to each of said plurality of banks within the memory cell array; and a programming circuit for programming said banks except a last bank among said banks with page data, said page data loaded through said page buffer is programmed into each of said cache blocks such that when said page data for the last bank is loaded into the page buffer and said page data programmed into the respective cache blocks are copied back to corresponding page buffers, page data associated with all of said page buffers are programmed into respective corresponding banks.
 2. A programming method in a multilevel flash memory for which a memory cell array is divided into a plurality of banks, the method comprising: preparing a cache block associated with each of said plurality of banks; sequentially programming cache blocks, except a cache block allocated to a last bank of said plurality of banks, with MSB page data sequentially loaded through corresponding page buffers, programming said MSB page data after an LSB page program is performed directly into a selected block without passing through a cache block; copying said MSB page data programmed in said cache blocks back to the corresponding page buffers in response to a predetermined program command after the MSB page data is loaded into the last page buffer; and simultaneously programming corresponding blocks of said banks with said MSB page data contained into all of said page buffers.
 3. The method of claim 2 wherein the predetermined program command is a high performance program command.
 4. A nonvolatile semiconductor memory device programming data including a least significant bit (LSB) page and a most significant bit (MSB) page, the device comprising: a memory cell array divided into N banks; N page buffers, each respectively associated with a corresponding one of the N banks and being configured to load a page of data; and N caches, each respectively associated with a corresponding one of the N banks and being configured to receive data programmed from a corresponding one of the N page buffers, wherein each one N page buffers is further configured to program an LSB page directly to the corresponding one of the N banks without passing through the corresponding one of the N caches, and each one of the 1^(st) through N−1th page buffers is further configured to program an MSB page to the corresponding one of the N banks through the corresponding one of the N caches.
 5. The device of claim 4, wherein the data comprises a plurality of MSB pages, and each one of the N page buffers is further configured to respectively and sequentially load one of the plurality of MSB pages, and program the loaded one of the plurality of MSB pages to the corresponding one of the N caches.
 6. The device of claim 5, wherein each one of the 1^(st) through N−1th caches is further configured to copy back to the corresponding 1^(st) through N−1th page buffers the programmed MSB page after the 1^(st) through N−1th MSB pages are programmed to the 1^(st) through N−1th caches.
 7. The device of claim 6 wherein each one of the 1^(st) through N−1th MSB pages is respectively programmed to the 1^(st) through N−1th caches using single-level cell (SLC) programming.
 8. The device of claim 7, wherein the 1^(st) through Nth MSB pages are simultaneously programmed from the N pages buffers to the N banks using multi-level cell (MLC) programming.
 9. A method of programming multi-level data including N pages of most significant bit (MSB) data to a nonvolatile semiconductor memory device having a memory cell array divided into 1^(st) through Nth banks, the method comprising: sequentially loading an i^(th) page of MSB data to an i^(th) page buffer, and then programming the i^(th) page of MSB data from the i^(th) page buffer to an i^(th) cache uniquely associated with an i^(th) bank, wherein “i” ranges from 1 to N−1, wherein the respective programming of the 1^(st) through N−1th pages of MSB data from the 1^(st) through N−1th page buffers to the 1^(st) through N−1th caches is performed using single-level cell (SLC) programming; loading a Nth page of MSB data to an Nth page buffer; and then, simultaneously and respectively programming the 1^(st) through Nth pages of MSB data from 1^(st) through Nth page buffers to the 1^(st) through Nth banks using multi-level cell (MLC) programming; and after sequentially and respectively loading the 1^(st) through N−1th pages of MSB data to the 1^(st) through N−1th page buffers, and programming the 1^(st) through N−1th pages of MSB data from the 1^(st) through N−1th page buffers to the 1^(st) through N−1th caches, copying back the 1^(st) through N−1th pages of MSB data from the 1^(st) through N−1th caches to the 1^(st) through N−1th page buffers before loading an Nth page of MSB data to an Nth page buffer.
 10. The method of claim 9, wherein the copying back of the 1^(st) through N−1th pages of MSB data from the 1^(st) through N−1th caches to the 1^(st) through N−1th page buffers is performed simultaneously.
 11. A method programming multi-level cell (MLC) data to nonvolatile semiconductor memory having an array of MLC memory cells divided into N banks, wherein each one of the 1^(st) through Nth banks is programmed through at least one of corresponding 1^(st) through Nth page buffers and 1^(st) through Nth caches, the method comprising: preparing a cache block for the 1^(st) through N−1th banks by sequentially programming 1^(st) through N−1th pages of MLC data from the 1^(st) through N−1th page buffers to the 1^(st) through N−1th caches; loading an Nth page of MLC data to the Nth page buffer; and then, simultaneously and respectively programming the 1^(st) through Nth pages of MLC data from the 1^(st) through Nth pages buffers to the 1^(st) through Nth banks; and before simultaneously and respectively programming the 1^(st) through Nth pages of MLC data from the 1^(st) through Nth pages buffers to the 1^(st) through Nth banks, copying back the cache block by respectively copying back the 1^(st) through N−1th pages of MLC data from the 1^(st) though N−1th caches to the 1^(st) through N−1th page buffers.
 12. The method of claim 11, wherein programming of the 1^(st) through N−1th pages of MLC data from the 1^(st) through N−1th page buffer to the 1^(st) through N−1th caches is respectively performed by executing a single-level cell (SLC) program operation.
 13. The method of claim 11, wherein simultaneously and respectively programming the 1^(st) through Nth pages of MLC data from the 1^(st) through Nth pages buffers to the 1^(st) through Nth banks is performed in response to a single high performance command. 